Search CORE

39 research outputs found

A fast algorithm to initialize cluster centroids in fuzzy clustering applications

Author: Cebeci Cagatay
Cebeci Zeynel
Publication venue: 'MDPI AG'
Publication date: 15/09/2020
Field of study

The goal of partitioning clustering analysis is to divide a dataset into a predetermined number of homogeneous clusters. The quality of final clusters from a prototype-based partitioning algorithm is highly affected by the initially chosen centroids. In this paper, we propose the InoFrep, a novel data-dependent initialization algorithm for improving computational efficiency and robustness in prototype-based hard and fuzzy clustering. The InoFrep is a single-pass algorithm using the frequency polygon data of the feature with the highest peaks count in a dataset. By using the Fuzzy C-means (FCM) clustering algorithm, we empirically compare the performance of the InoFrep on one synthetic and six real datasets to those of two common initialization methods: Random sampling of data points and K-means++. Our results show that the InoFrep algorithm significantly reduces the number of iterations and the computing time required by the FCM algorithm. Additionally, it can be applied to multidimensional large datasets because of its shorter initialization time and independence from dimensionality due to working with only one feature with the highest number of peaks

University of Strathclyde Institutional Repository

Comparison of K-Means and Fuzzy C-Means Algorithms on Different Cluster Structures

Author: Cebeci Zeynel
Yildiz Figen
Publication venue: 'Journal of Agricultural Informatics'
Publication date: 01/01/2015
Field of study

Repository of the Academy's Library

Efficiency of Random Sampling Based Data Size Reduction on Computing Time and Validity of Clustering in Data Mining

Author: Cebeci Zeynel
Yildiz Figen
Publication venue: 'Journal of Agricultural Informatics'
Publication date: 01/01/2016
Field of study

Repository of the Academy's Library

Two novel outlier detection approaches based on unsupervised possibilistic and fuzzy clustering

Author: Bayyurt Lutfi
Cebeci Cagatay
Cebeci Zeynel
Tahtali Yalcin
Publication venue: 'PeerJ'
Publication date: 27/09/2022
Field of study

Outliers are data points that significantly deviate from other data points in a data set because of different mechanisms or unusual processes. Outlier detection is one of the intensively studied research topics for identification of novelties, frauds, anomalies, deviations or exceptions in addition to its use for data cleansing in data science. In this study, we propose two novel outlier detection approaches using the typicality degrees which are the partitioning result of unsupervised possibilistic clustering algorithms. The proposed approaches are based on finding the atypical data points below a predefined threshold value, a possibilistic level for evaluating a point as an outlier. The experiments on the synthetic and real data sets showed that the proposed approaches can be successfully used to detect outliers without considering the structure and distribution of the features in multidimensional data sets

University of Strathclyde Institutional Repository

PubMed Central

Unsupervised Discretization of Continuous Variables in a Chicken Egg Quality Traits Dataset

Author: Cebeci Zeynel
Cebeci Zeynel
Yıldız Figen
Publication venue: Figen Yıldız
Publication date: 01/01/2017
Field of study

Discretization is a data pre-processing task transforming continuous variables into discrete ones in order to apply some data mining algorithms such as association rules extraction and classification trees. In this study we empirically compared the performances of equal width intervals (EWI), equal frequency intervals (EFI) and Kmeans clustering (KMC) methods to discretize 14 continuous variables in a chicken egg quality traits dataset. We revealed that these unsupervised discretization methods can decrease the training error rates and increase the test accuracies of the classification tree models. By comparing the training errors and test accuracies of the model applied with C5.0 classification tree algorithm we also found that EWI, EFI and KMC methods produced the more or less similar results. Among the rules used for estimating the number of intervals, the Rice rule gave the best result with EWI but not with EFI. It was also found that Freedman-Diaconis rule with EFI and Doane rule with EFI and EWI slightly performed better than the other rule

Siirt University Institutional Repository

Ceylanpınar Tarım İşletmesi siyah-alaca sığır populasyonu üzerinde bir çalışma

Author: Cebeci Zeynel
Publication venue: Çukurova Üniversitesi, Fen Bilimleri Enstitüsü
Publication date: 01/01/1985
Field of study

TEZ267Tez (Yüksek Lisans) -- Çukurova Üniversitesi, Adana, 1985.Kaynakça (s. 70-72) var.74 s. ; 29 cm.

Çukurova University Institutional Repository

Tarımda Yapay Zeka: R'de Derin Öğrenme İle Hızlı Bir Adım

Author: Cebeci Zeynel
Publication venue
Publication date: 01/01/2019
Field of study

Büyük ölçüde doğrusal olmayan veri temsilini modellemek için kullanılan bir makine öğrenmesi alanı olan derin öğrenme son yıllarda büyük bir ivme kazanarak hemen endüstriyel alanı hızla etkilemektedir. Genelde ses, görüntü ve metin tanımaya dayalı sürücüsüz araçlar, doğal dil işleme, kanser hücrelerinin tanınması gibi birçok alanda uygulanmaya başlanmıştır. Derin öğrenmenin dayandığı algoritmalar çok uzun zamandan beri biliniyor olsa da uygulamaya aktarılması oldukça yenidir. Yüksek işlem ve depolama gücüne sahip bilgisayar ve sistemlerin geliştirilmesi derin öğrenme konusunda çalışmaların sayısında önemli bir artışa neden olmuştur. Parametrik ve non-parametrik istatistik analiz, doğrusal ve doğrusal olmayan modelleme çalışmaları, zaman serisi analizleri, sınıflama ve kümeleme uygulamaları, mekânsal bilişim ve daha başka alanlarda çeşitli istatistiksel ve grafiksel teknik ve yöntemlerin uygulamalarını içeren onaltı bin civarında paketi ile R zengin bir özgür ve açık kaynak yazılımdır. R hemen her alandaki çözümlere ilaveten son yıl birkaç yıl içinde keras ve tensorflow paketlerinin geliştirilerek dağıtılması nedeniyle derin öğrenme alanında da tercih edilen bir uygulama ve geliştirme ortamı olmaya başlamıştır. Bu bildiride R'nin derin öğrenme uygulamalarında kullanımı küçük bir süsen türleri görüntü tabanı üzerinde tanıtılarak R kullanan veri bilimcileri ve mühendisleri için sağladığı olanaklar gösterilmektedir

Çukurova University Institutional Repository

fcvalid: An R Package for Internal Validation of Probabilistic and Possibilistic Clustering

Author: Zeynel Cebeci
Publication venue: 'Sakarya University Journal of Computer and Information Sciences'
Publication date: 01/04/2020
Field of study

In exploratory data analysis and machine learning, partitioning clustering is a frequently used unsupervised learning technique for finding the meaningful patterns in numeric datasets. Clustering aims to identify and classify the objects or the cases in datasets in practice. The clustering quality or the performance of a clustering algorithm is generally evaluated by using the internal validity indices. In this study, an R package named 'fcvalid' is introduced for validation of fuzzy and possibilistic clustering results. The package implements a broad collection of the internal indices which have been proposed to validate the results of fuzzy clustering algorithms. Additionally, the options to compute the generalized and extended versions of the fuzzy internal indices for validation of the possibilistic clustering are also included in the package

Crossref

Directory of Open Access Journals

Süt sığırcılığında damızlık seçiminde en iyi doğrusal yansız tahmin(Best Linear Unbiased Prediction) yöntemi, yönteme ilişkin bilgi işlem algoritmaları ve Ceylanpınar Tarım İşletmesi siyah- alaca sığır populasyonuna uygulanması

Author: Cebeci Zeynel
Publication venue: Çukurova Üniversitesi, Fen Bilimleri Enstitüsü
Publication date: 01/01/1990
Field of study

TEZ868Tez (Doktora) -- Çukurova Üniversitesi, Adana, 1990.Kaynakça (s. 158-165) var.167 [8] s. ; 30 cm.

Çukurova University Institutional Repository

A Model Study For Establishing A National E-University In Turkey

Author: Zeynel Cebeci
Publication venue
Publication date: 01/01/2004
Field of study

E-Üniversiteler, eğitim-öğretim amacıyla kullanılan fiziksel binaları bulunmayan; gerçek üniversitelerin sanal modelleri olarak eğitim-öğretim ve yönetim hizmetlerini başta Internet olmak üzere ağ aracılığıyla tümleşik biçimde sağlayan örgütlenmelerdir. Sunulan hizmetler, farklı biçimlerde çevrim-içi öğrenme içerikleri, uzmanlaşmış sanal merkezler, kütüphane ve yönetsel işlevler, eşzamanlı ve eşzamansız etkileşimli araçlar, çevrim-içi birlikte çalışma ortamlarını (çalışma grupları, çevrim-içi dersler, seminerler ve uygulamalar, tartışma forumları, sesli ve görüntülü konferanslar vb) kapsamaktadır. Bu çalışmada, çeşitli ülkelerdeki e-üniversite uygulamaları kısaca tanıtılmakta ve Türkiye için önerilen bir e-üniversite modeli önerilerek örgütlenme ve işleyiş açılarından tartışılmaktadır.E-universities are the virtual organizations that provide the integrated educational services of student management and teaching on the computer networks, especially on the Internet, without having physical buildings and/or campuses. Offered services and facilities are consist of online learning materials in different formats, virtual centers, electronic libraries, management software, syncronous and asyncronous interactive tools, online collaboration environments (working groups, online course, seminars and practice works, discussion forums, audio-visual conferences etc.). In this study, some e-university models and approaches in many countries are introduced, then a countrywide e-university model which can be established in Turkey are discussed on the aspects of organization and working system, briefly

Çukurova University Institutional Repository